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ABSTRACT 

Cooperative book acquisition plans and improved 
systems of interlibrary lending are intended to eliminate duplication 
of library holdings. In order to calculate possible savings, an 
investigation of how much duplication exists in the libraries on the 
various campuses of the University of California was made. Estimates 
were developed for the extent of the overlap of the monograph 
holdings of the OCLA collection with those of the other southern 
University of California campuses, and of the monograph holdings of 
the Berkeley collection with those of the other northern campuses. 
The historic usage rates of the overlapped portions of the 
collections also were estimated. The methodology by which these 
estimates were obtained is quite general and with appropriate 
modifications should be applicable in investigations of a similar 
nature in other library systems. (Author/PF) 
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ABSTRACT 



The rising costs of books and book processing have caused many 
library policymakers to consider more carefully the advantages of inter- 
library cooperation o£ various kinds, including cooperative book acquisi- 
tion plans and improved systems of interlibrary lending. A key question 
that has to be answered before sensible decisions can be made on these 
matters is: How much duplication among library collections is there that 
such schemes might potentially eliminate? This paper reports the results 
of an investigation of this question for the libraries on the various cam- 
puses of the University of California. Estimates are given for the ex- 
tent of the overlap of the monograph holdings of the UCLA collection with 
those of the other southern University of California campuses » and of 
the monograph holdings of the Berkeley collection with those of the other 
northern campuses. Estimates of the historic usage rates of the over- 
lapped portions of the collection are also given. The methodology by 
which these estimates were obtained is quite general and with appropriate 
modifications should be applicable in investigations of a similar nature 
In other library systems. 
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I. INTRODUCTION 



In an attempt to find economies to offset the rising costs of 
acquiring, processing, and storing books, library administrators have 
of late been giving more serious consideration to the possibility of 
cooperative schemes of various sorts which would allow them to share 
these costs with other libraries. Prominent among the cost-sharing 
schemes are such well-known ideas as cooperative acquisition plans where- 
by only, say, one copy of a book need be purchased by a group of librar- 
ies, with accompanying plans for improved interlibrary lending under 
which all members of the group would have easy access to that one copy. 
But whether substantial economies could be realized under such arrange- 
ments depends at least partly upon the extent of the duplication in hold- 
ings that could thereby be avoided. In there very much duplication among 
the collections of large research libraries? And if there is, how heavily 
used are the duplicated Jocuments at their respective locations? Unless 
these facts are knowv It is difficult to make rational policy decisions 
about whether cooperative systems for avoiding duplication are worthwhile. 

The purpose of this paper is to report the results of an investi- 
gation into the extent and nature of the duplication of monograph holdings 
among the various University of California libraries. Each of the nine 
campuses of the University of California has its own local library system, 
and the extent to which the monograph collection of any one of these 
local systems overlaps the collections of any other has heretofore been 
a matter of conjecture. The aim of the investigation was to obtain rough 
first estimates of the extent of this duplication, and In addition some 
preliminary idea of how heavi]y the duplicated portions of the collections 
are used. 

The scope of the investigation was restricted to books or monographs 
as opposed to serials. This restriction was made partly for research con- 
venience, and partly because of the existence of plans for an independent 
investigatton of (duplication of serial holdings at some future time. 
More importantly, the investigation was limited to fact-finding as opposed 
to policy-making. No attempt was made to draw any conclusions from the 
data about the advisability of interlibrary cooperation or the form such 
cooperation might take, this being a difficult proMem which lay beyond 
the immediate mission of the research group. The significance of our 
findings for interlibrary cooperation will be the topic of a later paper. 

Our invf.stigatlon may be compared in Its general Intent with simi- 
lar studies of collection overlap in six New England state university 
libraries,! in five Washington, D.C, university libraries, 2 in six New 
South Wales libraries, 3»A and in the University of London libraries.^ 
The present study differs from these In methodology, however, combining 
an investigation of collection overlap with a followup study of usage 
rates of the overlapped materials. It Is hoped that, in addition to 
whatever intrinsic interest the findings themselves may have, the metho- 
dology may be of Interest to those who will have future occasion to make 
the same kind of investigat/cn elsewhere. 
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i i ♦ tJENHRAL METHO!)OLOGY 



CJeographicai ly the campuses of the University of California fall 
into two dlstlnirt clusters, one in the north-cenLral part of the state 
and the other in the southern part. The northern cluster consists of 
Berkeley y the largest of the nine campu5;cs,. and three other ssialler 
campuses lying respectively north, west, and south of Berkeley at Davis, 
San Francisco, and Santa Cru2# The southern cluster consists of the 
ho^ Angeles campus, the second- largest of tlie nine, surrounded by smaller 
campuses at Santa Barbara, San Diego, Riverside, and Irvine. One of 
the Implications of these circumstances for possible future Interllbrary 
cooperation is that it might be reasonable to consider making the library 
at Berkeley the hub member of a northern library constellation and the 
U*C*L^A» library the hub of a southern constellation, with possible fur- 
ther interaction taking place between the Berkeley and U*C«L*A« libraries 
themselves* Other tuture arrangements arc, of course possible too, but 
almost any cooperative venture that might reasonably be considered is 
apt to be characteri:^ed at least by substantial interaction between 
Berkeley and each of the other northern campuses, and between Los Angeles 
and each of the other southern campuses* 

These geographical facts suggested a research procedure which would 
provide information ab:»ut the holdings duplicated between the following 
pairs of campuses: Berkeley-Davis, Berkeley--Santa Cruz, Berkeley-San 
Francisco, Los Angcies-Santa Barbara, Los Angeles-^San Diego, Los Angeles- 
Riverside, Los Angeles-Irvine, and Los Angeles-Berkeley* A ba*iic research 
strategy was therefore settled upon which consisted of (A) drawing a 
random sample of monographs from the Berkeley collection and seeing what 
proportion of these were to be found also at Davis, Santa Cruz, and 
San Francisco; (B) drawing a random sample of monographs from the Los 
Angeles collection and finding cut the proportion of these that were 
also at Santa Barbara, San Diego, e^c»; and (C) finding out what propor- 
tion of the Los Angeles sample were also In the Berkeley collection. 
From these sample data it was possible to deduce (to an approximation) 
the amount of monograph material held in common between any two of the 
campus pairs of interest, expressed either as an absolute number of 
monograph titles held in common or as a proportion of total collection 
size. The compilation and statistical analysis of these overalp figures 
completed the first phase of the project. 

The aim of the second phase was to gain some idea of how heavily 
the duplicate holdings are used. To this end historical circulation 
data were obtained for all monographs in the sample which step (A) re- 
vealed to be held in common between Berkeley and another northern campus, 
or which step (B) showed to be shared between Los Angeles and another 
southern campus. The circulation data were obtained both for *hub* 
campuses aad %^ut lying/ campuses, e.g. for books held by both Davis and 
Berkeley tlu circulation history at Davis ns well as the circulation his- 
tory at Berkeley was </bLained« Although the circulation data were in- 
complete in certain respects they were sufficient to make rough frequency 
counts of the number of times each duplicated book had circulated during 
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the flire^ear period of 1969-73. It was alt.c p^^ssible to make a rough 
determinatlonL for each campus of the proportions of the monographs that 
were classifiable as 'high udage\ •low usage\ and ^no uaage^ books 
according to an ali;orithia designed by the State Auditor for that purpose. 
Since circulation history does not take In-house usage into account^ the 
number of times a book has circulated. underestimates to some unknown 
, extent the number of '^imes it has real7y be^ni used. Thus the circulation 
figures are really only lower bounds on the actual usages* Nevertheless 
there Is evidence to i^uggest that circulated usage is at least roughly 
proportional to true usage and In this sense can be used as a comparative 
indicator of it» 



111. DEFINITIONS Of THE M0NCK;RAPH COLLECTIONS SURVEYED 



The general research strategy cailed for the drawing of a rahdoiB 
sample of the monographss lii the Berkeley collection and the comparison 
of this sanjplk? against the monograph holdings of each of the other nor- 
thern camputsus* tlie whole operation then to be repeated in the south* 
But before this strategy could be in^lemented it was necessary to define 
carefully just whut was to he loeant by the 'Berkeley collection*, the 
•Los <\ngeles collootUm*, and so forth. The problem is not tr^vlal; on 
the Berkeley campust for example, there are some twenty-six different 
local subcol lections not counting a group of miscellaneous institute and 
departmental libraries. Which of these subcollections should the Berkeley 
sample be drawn from? 

In view of the fact that the project findings were to be used partly 
as a basis for administrative decisions about cooperative acquisition and 
interlibrary loa: >.Uoies, it seemed reasonable to define the collec- 
tions to he iiamploJ in such a way as to coincide as closely as possible 
with the rtot of ali inenograph titles available pr likely to be available 
in the future for tnt^rliUrary loan. Judgments about the Interlibrary 
lendability of each subcollection at Berkeley and Los Angeles were there- 
fore made on tlie basis of recent annual reports of those libraries and 
interviews with library staft members, and it was decided to draw the 
samples bv sampl in?, fron combinations of card catalogs whose collective 
contents. corresponded reasonably well with the subcollections judged 
loadable . 

Specifically, the Berkeley sample was drawn from four card catalogs: 
(I) the Official Catalog, which is a shelf list of all titles in Berkeley's 
central collection (Doe Library) and twenty Berkeley branch libraries; 
(ii) the snelf UsL for the Rowell Collection, which covers other mater- 
ials cataloged priei In the University's adoption of the Library of 
Congress system; (Ui) the shelf list for the Law Library; and (Iv) the 
Temporary OriCalogJng Pool catalog, which provides temporary access to 
newly acquired ducum<?nts that have not yet been fully cataloged. Thus 
the operative meaning of the term 'Berkeley collection* in this report 
Is the set of all monographs for which there is an entry in any of these 
four catalogs. The 'Berkeley collection* so defined is thought to cor- 
respond moderately i:losely with the set of all potentially lendable mono- 
graphs at B-rkeley, though there are important exceptions: the General 
Reference collection, for example, is included in the Official Catalog 
but does not circulate. Among the Berkeley holdings Intentionally ex- 
cluded from the Berkeley collection were the Bancroft Library and Morrison 
Library collections which do not circulate, and the East Asiatic Library 
which does not normally circulate and moat of whose catalog cards are 
in Asian languages. The circulating Moffit Undergraduate Library collec- 
tion, though separately cataloged, consists almost entirely (96%) of 
titles included in other Berkeley libraries, and so is included in the 
'Berkeley . oliecLion* lor all prat tit al intents and purposes. By volume 
count, the 'iJerkeley collection' contains some 89% of Berkeley's total 
holdings bofort- the exclusion of nonmonographic materials. 
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The *Loa Anp.^los roi 3 action* vus deflnrJ to consist of al3 mono-^ 
graphs for which thr^^t* is an ftitry In one ol the following cataloga: 
U) the TechniLd^ Jc^vicc^c^^ i^pai* uuL Shvlf Li.-t; (ii> th*^ Physical 
Science Libraries nht lf lint; iiii) fho lau ;»\4ary shcif liHt; and (iv) 
the Biomedical Library shelf list, i"* jiloctiveiy chi^se four catalogs 
cover some twtfnty *:n] iert Ions inclttdiug^ in thi,* Unlvt^rsity Research 
Library^ the folJowina c lltotlons: <;tack, Refc^rence, Public Affairs 
St?rvice» St>tHual^ llu ait / Arts, ami Technical Rt^r'icei=^; the College 
l,!brary and the EducuLii^n & Psychology collection^ in the Powell Library 
Building; tht* foil >vlrg brnnch librc>ricf:; Architecture^ Art, Management^ 
Map, and Music; ^nJ the tollovin^r pliyslcal i«eit*nce library' collections: 
Engini»er{ng -vS Mathernat io^il Sciences, I'hysfrs, Geology & Geograpny, and 
Chemistry* Not incluJ td wt-ro che Clark Library, thp English Heading 
Room, and the Univcrsiiy EIeiu;..!t arv School collictionn, none of which 
circulates; the Orif*nt il collect ;i r; and the 5?o*-callid ''Brief listed 
Volumes** as well as anv ot'v^r ai:ilo|;ed items* Here again the corres- 
pondence of the .'ullei tiuu it iint>d is thought to be in fair, though 
not perfect, corresp ^ *ji t w*th tht? sei of lendabie monographs on the 
Los Angeles campu^>' 

At the sevt»a *vfiiiyiii|^;' campur. *s there were fewer subcollectionn 
to choose from nee i^^wor problemf^ about which to include in the 

present study. Ih .iiiy, the *Di)vis collection' was defined to be the 
set of all mono)i;rinhf» icrrescnced in the main author-title catalog, the 
Law Library cauU^^u . lui the Ru^^ornment documents catalog on the Davis 
campus* The •Snntri Cru;i f ol le.* ti»»ii* was taken to consist of all mono«- 
graphs rcprenenied in the l*?7i ntiichlne-produced author-title catalog, 
its 1972 and 197 3 updates » cind tlK- in-process file on that campus* The 
•San Francisco collection' was luken to be ail monographs in that cam*- 
pui 'f? author-title cnt i1o^» The 'Santa Barbara collection' was taker 
to be al 1 monographs rci resented in tlie author-title catalog, the cata- 
log ot ^rt exhibitiuu catalogs, and the government documents catalog. 
The 'San Die^u c^lli^ ct ion' cun^isted of aJ I monographs reprerented in 
the author-title cacaliJi;: the government documents catalog, and the in- 
process flle# The 'Riverside collection' consis^ted of all monographs 
represented in tfie lutiior-Litle catalog, the catalog of the Bio-Agricul- 
tural Library, arrd ri*e gc>v*frnment documents catalog. The 'Irvine col- 
lection' was defined m include the monographs represented in the 
authoi -title catalog, the catalog of the Medical Sciences Collection, 
the government docuroentn catalog, and the in-process file. For purposes 
of compaiTing the Leu Angeles collection against the Berkeley collection 
only, the Serkelev collection was redefined to consist of all monographs 
covered by the main author- title and government documents catalogs* 
These choices were to some extent arbitrary, but some such decisions 
had to be made for the sake of definiteness c^nd to Indicate clearly the 
extent of the materials searched on e^ch campus. 

The term 'monograph' was defined to include individually cataloged 
issues of monographic serials, hut to exclude monographic serials cata- 
logued only «is . ' • \ ^cri 1^ pamphlets ♦ ?4heet retislc enhemern, and 
all nonprint raaterKaJs •^urh as phonograph records, paintings, and trans- 
parencies. Monoj^raph-? in tnlcrocopy form were excluded from the Berkeley 
and Los Angeles collections but Included in the collections of the 
smaller campuses. 



TV. COlXECTrON SJ2ES 



With the collectionij of interest doflned cJeacly, the m^xL step 
was to estimate the numbtT of monograph titles in each« The size of 
the Berkeley collection was estioiated to be 1^452,000 n;mograph titles^ 
and the size of the Los Angeles collection was estimated as 1^247,000 
taonograph titles* These estimates ©ay be compared with the Independent 
estiinates of 1,772^000 titles for Berkeley and 1,305^000 titles for Lo. 
Angeles derived for the comparable collections from d«^t*i given in 
Reference (6>> a report prepared in 19/3 under the direction of LeRoy 
Ortopan of the Berkeley cataloging department which will hereinafter be 
referred to as the 'Ortopan report'* A difference of about 7 or 8 per*- 
cent between the two Kf*ts of figures was to be expected because the es-* 
timates in the Ortopan ri*port include serials as well as monographs while 
ours do not, and because the Ortopan estimates were based on an assumption 
of one title per catalog card while ours included an adjustment for con-* 
tinuatlon cards* When these factors are taken into account our estimates 
are in reasonably good agreement with the Ortopan estimates for the Loa 
Angeles campus hvt arc somewhat lower than the Ortopan figures for 
Berkeley. We are unrxMe to explain the discrepancy^ 

Because of the apparent conflict with the Ortopan findings It may 
be of interest to include a brief description of the methodology by which 
our estimate of !.h<; Berkeley ci4 lection si^:e was obtained. As already 
explained, the Berkeley collection was specified in terms of lour card 
catalogs. Since these catalogs were shelf lists* none contained more 
than one entry per f i^?*^. Moreover, the four happened to be mutually 
exclusive In the senst-* th-^t no monograph represented in one was repre- 
sented in ar> of the other three. These circumstances made it feasible 
to estimate the* Berkeley collection si^e from the si2e of the card cata-^ 
logs. Thanks to the fact that it had recently been photocopiedt the 
exact number of cardr, (1,217,793) in the Official Catalog was known« 
The total length of the cards in the remaining three catalogs was mea*- 
sured in ce:it Imeters. A random sample of 216 two^ent imeter blocks of 
cards was drawn from the Official, Rowell and Law catalogs collectively 
and the niimber of monograph titles in each block counted directly # From 
this sample it was t*alculated that there were an average of .936 mon- 
graph titles per catalog card and 35«2 monograph titles per centimeter 
of catalog cards in these three catalogs* Separate estimates of these 
factors were made for the Temporary Cataloging Pool catalog, in which 
the use of brief listings and thin-paper cards made it seem likely that 
both factors would be substantially different. From these data in si2e 
of the subcollection represented by the Official Catalog was calculattnl 
to be about 1,130,000 monograph titles, by the Rowell Catalog at 174,000 
monograph titles, by the Law School Catalog at 64,000 monograph titles, 
and by the Temporary Cataloging Pool at 84,000 monograph titles* The 
standard errors in those figures introduced by the sampling technique 
used to Rct the conversion factors were 0.38%, 0.37%, and 3.07% respec- 
tively. The stin ol i\\v foui figures is 1, 4^)2, 000 monograph titles. 

The sizes of the smaller campus collections were net independently 
estimated but were instead computed from data given in the Ortopan 



report . The computation included an adjustment of the Ortopan figures 
for continuation cards and for the exclusion of serials from the collec- 
tions of interest* The resulting numbers of monograph titles estimated 
to be in each collection were as follows: 

Davis 568,000 titles 

Santa Cruz 265»000 

San Francisco 87 » 100 

Santa Barbara 588,000 

San Diego 647,000 

Riverside 403,000 

Irvine 425,000 



No error estimates are available for these figures 



V. SAMPLING PROCEDURES 



i«« si«s^of the ran4o« saiaples to be drawn froa the Berkeley and 

the width of the confidence Intervals that could be to^^^ 

final overlap estiinatei*. It was arbitrarily decided that for -his^lor 

agrye^^^^^^ 

frii^r ?f ^ ^^^^^ P^^'^^^^ ^ either direction 

from^the estioated perceatage overlap figure. If, for example. 201of 

^^f'f^^^^^^^^^^^^ to be duplicated at DavSr^he co^f i- 
4? Se«rf ^f^to be^deeaed narrow enough If it extended from 17% to 
:f \ ^ fairly wide confidence Intervals, as statistical studies 

go. but they were felt to be narrow enough to giie at least a rough Id^ 
stu5v" 'M^r'^'f 1 that was to be attempted in this prelUnSy 
f''* ^"" "^^^ them much narrower would have required a dis- 

t^ut trw!d.r'r'' '"^^ ^n'^ ^^^^^^ mle of th««b is tha? 

JLT, ^ , I ^ ""^^^^^^^ interval in half one must quadruple the 
tZrl", t' "^1^^ substantially narrower confidence intervals would 
tfteretore have been beyond the resources made available to us for the 

study* ... . . 

t"^""^ ^ ^^^^ confidence Intervals in general, 

since the correct interpretation of all the findings to be given S the 
sequel tequires a clear understanding of what a confidence interval is. 
A .95 confidence interval for an estimate is a pair of numbers def inlna 

JLurf^J^J f ''""^ ^^l^^^ ^"^^^ ^«*8»»^y speaking, the true 

figure of interest has a 95% chance of lying in the interval. (A more 
exact definition is that for any point lying outside the interval, if 
that point had been the true figure of interest the chances would have 
been leas than five in one hundred of obtaining an estimate as far away 
from the point as the. actual estimate lay.) Thus a .95 confidence inter- 
val for an overlap estimate that stretches from 17% to 23% indicates that 
the true overlap very likely lay between 17% and 23%. When the coSlidenL 
intervals are wide they take on more significance than the estimate itself 

It is a common misconception about statistical methodology that if 
one is sampling from a very large population, a very large sample should 
be necessary This idea is mistaken, because afte7a certain ^Lfthe 
population size has little effect on the sixe of the sample neSed. 
Sample sizes are therefore to be decided upon with reference to the sta- 
ll tt\r^^''^^AT^ ^"Z*"^ ^^"^^ estimate that is tolerable as Indicated 
e.g. by the confidence interval, and not on the basis of population size. 

With the aid of a statistical chart, 7 it was determined that a sam^. 
If^l^A 4 *r"Vl u"^ thousand titles would yield confidence Intervals 

the desired width. (The actual sample sizes drawn were 1024 titles 
fr<^ Berkeley and 1003 from Los Angeles.) The proportion of the sample 
to be drawn from each subcollectlon was determined in accordance with the 
SrI^^frl^^ nL5^? ««|;collection, e.g. at Berkeley the number of titles 
drawn from the Official Catalog was that proportion of 1024 which 
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corresponds to the proportion of the Berkeley collection comprta^d in 
the Off iclal Catalog. Within earh subcollection the selection of 
titles was randoa. This procedure is known as * stratified aao^ling* 
and is slightly superior in some respects to saspliag randoady from the 
pooled catalogs without distinguishing among then. 

. the actual sampling nechanisD involved selecting two hundred and 
sixty catalog drawers on each campus with the aid of a randoo xmbev 
table. «ithin- each drawer a random number of centimeters: was iseasiired 
in, the nunOter again being determined from a random msBber table, and 
the first coii9»lete ironpgraph entry folloi^ point was taken as a 
sample title. Three more titles from each di^^tet were obtained in a 
similar way by measuring in additional one-centimeter distances, return- 
ing to the front of the drawer if the cards ran out. The cards so ob- 
tained were then xeroKed to make up a record of the sample. It was 
necessary later to discard a few titles from each sample which turned 
out not to be monograph titles in the preestablished sense. 



VI. SEARCH PROCEDURE 



next st^p in the data-gathering process Was the searching of 
the catalogs at the smaller campuses for eabh title in the asanple. . The 
Berkeley sample wais searched for in the Davis, Santa Cruz, and San 
Francisco collections, and the Los Angeles sas^ile was searched for at 
Santa Barbara, San Diego, Riverside, Irvine, and Berkrley, 

At Davis, the first of the smaller campuses to be visited, the en*- 
tire Berkeley sample was searched under all; possible entries, but this 
procedure proved costly and produced only an insignificant miabcr of 
titles that would not have been found by looking up main entries only. 
Thereafter all but a few selected sample titles were looked up by ttdin 
entry only. 

If a docuiaent could be found In the smaller collection which was 
an exact bibliographic duplication of a document in the Berkeley or Los 
Angeles sample, it was recorded that the smaller collection contained an 
exact match of the sample item* Documents which were exact reprints of 
san|>le items or differed from them only in place of printing were also 
counted as exact matches. Documents which were textually similar to 
sample items but which differed from them in respect to edition, publisher, 
publication date, editor, compiler, translator* etc., wre recorded as 
approximate matches of the sample item. The same treatment was accorded 
to microcopy editions and partial duplication of multi-volume sets. This 
criterion for approximate match, thou^ somewhat arbitrary. Is at least 
quite definite. The motivating idea behind the criterion is that a cam- 
pus collaction should be recorded as containing an approximate match for 
a sample title provided it coutalned a document that stood a &eaM>>ably 
good chance of filling a user*s need, even though the user might h*4Ve 
requested the item in the precise form in which It existed In the sample. 
A translation of a work into another language was not counte<a as even 
an approximate match. 
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VII. riNl.iI.NGS a?N«'P:RNlNG EXTENT OF PUPI.lC/vTHM« 



tables lA-lC ,jreiSent the results of the investigatlott with res- 
pect to the extent of ov<»rlap between tlie aooograph collections of the 
varioiis caBBpuses. (In these and ail Cables to follow, an *A* in the 
table aissber indicates comparisons among northern caucuses, a *B* cob*- 
parljaons among soutlaern campuses, and a *C* a comparison between the 
Berkeley and Los AngeUa cafflpuses.) In column (1) of Table IA» for 
instance, we see that 25.6% of tl»e docmaents in the Berkeley sample had 
approximate duplicates in the Davis collection, resulting in an estimate 
of 25«6t for the proportion of the entire Berkeley collection that is 
approximately duplicatcjd at Davis. The confidence intervals of 23,0% - 
2a.4% that accompany this estimate indicate that <Iue to the statistical 
tmcertainty introduced by taking a sample of only about a thousand mono~ 

. graphs we can be reasonably ^ur« only that the true percentage of the 
{Berkeley collection approximately duplicated at Davis falls within that 
range. I^ese approximate match figures are to be understood as Including 
exact matches too. When attt>ntion is restricted to exact matches only, 
the extent of the duplication shrinks, and is indicated by the acccwnpany- 
ing italicized figures found below the approximata match figures* We 
see that about 19. 4%, or between 17.0% and 22.12. of the Berkeley coUec- 

» tlon is exactly duplicated at Davis. 

The bottom rows of Tables lA and IB tell the extent of the hub 
collection dupH'*Tted at one or more outlying campuses* For ^cample. 
Table lA states that for about 30.9% of the monographs in the Berkeley 
collection an approsiaate duplicate can be found on at l^st one other 
northern campus. 

Column (2) oi tables lA-IC translates the percentages in column (1) 
into absolute n<mibers of titles in the overlap. There are, for example, 
about 372,000 monographs held in common between the Berkeley and Davis 
caucuses, when the criterion of duplication Is the approximate match« 
It is the computatl3n of the figures in colusm (2) for which the esti- 
mates of the Berkeley and Los Angeles collection sizes were required* 
The confidev:ce intervals that accompany the estimates In column (2) were 
computed in such a way as to take into account only the statistical uncer* 
talnty that is due to the sample size of about one thousand; they do not 
reflect the unctrtalnty In the estimates of the Berkeley and Los Angeles 
collection sizes and so are slightly narrower than they should be. 

In column (3) the figures of coluans (1) and (2) are presented in 
still another way as estimates of the pruportions of the outlying caucus 
collections that are duplicated at a hub campus* Because error estimates 
are not available for the sizes of the collections at the smaller cam- 
puses, no confidence intervals were computed for the estimates in column 
(3). The figures in column (3) are the least tellable of any on the 
chart and should ht^- regari^^d as rough approximations only. 
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Vtli, CIRCULATION HISTORIES 



The purpose of the second phase of the Invest Igation Has to obtain 
nam indication of the amount of usage which the duplicated materials 
^yP^*=^lly receive. For this purpose circulation history as recorded on 
the charge slip in each book was taken as an indicator of usage. Ob- 
viously circulation history does not fully describe or neasure a book's 
:: true 1^ which includes library or 'in-house* uses as ii«ll as *eKtra- 
hOuse* uses* Nevertheless, recent studies by Pinzellk and Tolllver,^ 
McCrath^* Fussier and SlmoiH^O Morse,^^ and a report by the State 
Oepartment of Pinattcel2 have shown that those books which tend to be 

in-house are also those books which tend to circulate most 
frequently. Thus circulation records do provide some definite and measur- 
abifc basis for laaklng judgments about usaj^e, even though total usage nay 
far exceed circulated usagt: alone. 



To find out the :lrcuJation histories of Interest each nonograph 
title that was found In the first phase of the investigation to be shared 
by two campuses was located In the stacks of both cso^usea in question 
and the total number of charges indicated on the charge slips was re- 
corded by year. Honographs which could not be located la the stacks 
were recalled ^ If there was found to be more than one copy of a nonograph 
of interest on a canqjus, the nui^er of charges was averaged for all 
copies up to a linit of three copies » In eases where there were more 
than three copies that were approximate matches of a sample tltle» the 
three chosen for the purpose of recording the circulation data were 
those Judged to be the 'closest • approximations to the title In the 
senile. A special note was made of books In which the circulation record 
d|#peared to be incomplete— e.g. books in which old circulation slips 
had been torn out. 
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IX FINDINGS CONCERNING USAGE OF DUPLICATED MONOGRAPHS 



The tenn 'usage* will be used from now on in the aease of 'circus 
lated usage*, i.e. the nu^er of tlstes a book has circulated. We will 
present first the findings concerning usage on the Berkeley and Los 
Angeles campuses of monographs duplicated elsewhere. Tables IIA and IIB 
give estimates of the breakdown of the duplicated aoilographs according 
to circulation status and availability of circulation history on the 
Berkeley and Los Angeles caiaposes. We see fr<»B colman (1) of Table IIA, 
for example, that of all the monographs in the Berkeley sa8q[>le that had 
exact or approximate duplicates at Davis, 81.9% were in one of Berkeley's 
circulating collections and had circulation records that were Intact 
and accessible to the invest igatc^rs. This is the proportion of the du- 
plicated sample books on which the Berkeley usage figures to follow are 
based. The accompanying confidence interval shows the probable range 
of the percentage of such books in the entire Berkeley collection. 
Column (2) shows thv proportion of the duplicated monographs which cir- 
culate but for which the circulation history could not be determined, 
either because of lost charge slips or because the book in question was 
not in the stacks and attempts to recall it were unsuccessful. Coltmm 
(3) indicates the proportion of the duplicated monographs which do not 
circulate. One sees e.g. that of all monographs in the Berkeley collec- 
tion that have exact or approximate matches at Davis, between 2.4% and 
7.7% do not circulate at Berkeley. 

Confining attention now to circulating monographs at Berkeley and 
Los Angeles that are duplicated elsewhere and have knom usage histories 
(i.e. confining attention to the ntonographs represented in coluom (1) 
of Tables IIA and IIB), we may ask how many times such books were charged 
out at Berkeley or Los Angeles during the last five years. This question 
is answered in Tables IlIA and IIIB. The first row of Table IIIA shows, 
for example » that 36.9% t)f such books did not circulate at all from 1969 
to 1973, 26.1% were charged out only once or twice, 26,1% were charged 
out from three to ten times, and 10.8% were charged out more than ten 
times. (The number of books charged out n»re than twenty times, though 
not shown separately In the tables, was in all cases very aaall.) The 
.95 confidence intervals which accompany these figures indicate the pro- 
bable ranges of the corresponding percentages in the relevant portions 
of the entire Berkeley collection. 

In interpreting the usage figure in Tables IIIA and IIIB the reader 
should be aware of a methodological problem which stems from the fact 
that there is generally no way to tell Just by looking at a book exactly 
when the library acquired it. Ideally these tables would have been com- 
piled so as to show the five-year circulation count only for monographs 
vAilch had been on the shelves for the entire five year period, but it 
was not always possible to tell which these were. The makeshift proce- 
dure which was followod wa« to exclude from the analysis all monographs 
with imprint date^ of 1969 or later; these were discarded from the sample 
on the grounds that they were almost certain not to have been acquired 
until after the five-year period had commenced. This reroved a large 
part of the problem but not all of it; presumably there were some nonographs 
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la the aamples whose dates of prlatlng antedated 1969 but which Wtsre not 
actually acquired by the Btfktrlcy or has Angeles libraries unfll after 
1969^ It Is eviJtfnt that the latter are In i atnorlty, though, since at 
Becl^eley some 89-2% of the monographs reroalMlng in the sample after the 
exclusion of the post-1968 books were charged out at leaat ortce in 1969 » 
proving that at least these most already have been on the shelves at 
that time; tha corresponding figure for Los Angeles is S^-.^?* We con- 
clude that the usage frequencies shown in Tables IlIA and IlIB under- 
state the true five-year circulation rates somewhat, but probably only 
slightly. 

In Tables IVA and IVB the usage rates St Berkeley and Los Angeles 
of all circulatable, known-usage monographs duplicate elsewhere (again 
the populations of column (1) in Tables IIA and IIB> are analyzed in a 
different way. Instead of a straight circulation frequency coont over 
a fixed time interval* these tables classify the usage pattern over the 
- -entire active life of the book according to a special algorithm designed 
by the office of the State Auditor of California, This algorithm is 
presented in the Appendix in the form of a ilowchart; an explanation 
of it may be found in Refertioce (12). A convenient feature of the 
algorithm la that it can be applied to any book whose complete circu?- 
lation history U available even though the date on which the book was 
acquired may not be known* Hence no special methodological problems 
connected with acquisition data arise in connection with these tables, 
though of course the reader must familiarise himself with tl« algorithm 
before the classifications 'High usage* and •infrequent usage* can be 
interpreted meaningfully. 

Iflnally we turn to the usage data collected at the smaller cam- 
puses for monographs held in common between these cfm^mses and either 
Berkeley or Los Angeles* Tables VA and VB, which are analogous to IIA 
and IIB, show in column (1) the proportions of the duplicated monographs 
i^ich circulate at the smaller campus in question *Au>se conplete 
usage history there could be determined. For these books Tables VIA 
and VIB, which are comparable to IIIA and IIIB, give frequency counts 
of local circulations over the last five years; again books imprint 
dated 1969 or later were excluded from the count* Frequency counts 
for the San Diego collection' are omitted from Table VIB due to the 
fact that on that campus circulation records were not kept for many 
of .he local collections until well after the five-year period had 
started. San Francisco circulation data are also omitted. Tables 
VllA and VI IB, which complement IVA and IVB, re-analyze the usage 
rates in accordance with the State Auditor's algorithm. The usage 
figures for San Diego in Table VIIB must be interpreted with special 
care in view of the very brief recorded circulation histories on which 
the figures are based. 
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X. FUTURE RESEARCH POSSIBILITIES 



During the entire investigation^ the data were recorded In such 
a vay as to allow for possible future analyses in case at sone future 
titDe further analysis should be deetaed desirable. The iK>88ibiXi(:ies 
include enlargeiaent of the samples* a finer analysis of overlap and usage 
characteristics and an investigaticm of such special characteristics 
of the overlapped documents as age or language* 
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